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[57] ABSTRACT 

A semiconductor random access memory chip wherein 
the cycle ume is less than the access time for any combi- 
nation of read or write sequence. The semiconductor 
random access memory chip is partitioned into rela- 
tively small sub-arrays with local decoding and pre- 
charging. The memory chip operates in a pipelined 
manner with more than one access propagating through 
the chip at any given time and wherein the cycle time is 
limited by sub-array cycles wherein the cycle time is 
less than the access time for a memory chip having 
cycle times greater than access times for accesses 
through the same sub-array. The memory chip also 
incorporates dynamic storage techniques for achieving 
very fast access and precharge times. 
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PIPEUNED MiEMORY CHIP STRUCTURE 
HAVING IMPROVED CYCLE TIME 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to semiconductor static 
and dynamic memory structures and more particularly, 
to a pipelined semiconductor memory chip divided into 
sub-arrays havmg globally and locally generated decod* 
ing and locally generated precharge signals. 

2, Background Art 

The present invention includes a number of particular 
techniques and structures which are related to general 
concepts found In the prior art. For example, the pres- 
ent invention employs a form of sub-array structure, 
uses multiplexed sense amplifiers and incorporates a 
precharge technique. 

Representative prior art references which describe 



"A 32b VLSI System". Joseph W. Beyers, et al, 1982. 
Digest of Technical Papers, 1982, IEEE International 
Solid-State Circuits Conference, pages 128-129, men- 
tions that a 128 Kb RAM is pipelined. 

SUMMARY OF THE INVENTION 

* An object of the present invention is to provide a 
semiconductor random access memory chip wherein 
the cycle time is less than the access time for any combi- 
10 nation of read or write sequence. 

Another object of the present invention is to provide 
a semiconductor random access memory chip grouped 
into a plurality of sub-arrays. 

A further object of the present invention is to provide 
a semiconductor random access memory chip that is 
partitioned into relatively small sub-arrays with local 
decoding and precharging. 

StiU another object of the present invention is to 
provide a semiconductor random access memory chip 



memories with sub-arrays» but not for pipelined opera- including relatively small memory snb-arrays which are 
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tion, include 

VS. Pat No. 4,569,036, issued Feb. 4. 1986 to Fujii et 
al, entitled SEMICONDUCTOR DYNAMIC MEM- 
ORY DEVICE; 

U.S. Pat No. 4,554,646, issued Nov. 19, 1985 to Yo- 
shimoto et al, entitled SEMICONDUCTOR MEM- 
ORY DEVICE; 

U.S. Pat No. 4,542,486, issued Sept 17, 1985 to 
Anami et al, entitled SEMICONDUCTOR MEMORY 
DEVICE; 

US. Pat No. 4.482.984. issued Nov. 13. 1984 to 
Oritani, entitled STATIC TYPE SEMICONDUCTOR 
MEMORY DEVICE; 

U.S. Pat No. 4,447,895, issued May 8, 1984 to Asano ,« 
et al, entitled SEMICONDUCTOR MEMORY DE- 
VICE; 

U.S. Pat No. 4,384,347, issued May 17, 1983 to 
Nakano, entitled SEMICONDUCTOR MEMORY 
DEVICE; 

U.S. Pat No. 4,222.112. issued Sept 9. 1980 to Clem- 
ens et al, entitled DYNAMIC RAM ORGANIZA- 
TION FOR REDUCING PEAK CURRENT. 

References in the prior art directed to multiplexed 
sense amplifier input techniques include 

U-S. Pat Na 4,511,997. issued Apr. 16, 1985 to 
Nozaki et al, entitled SEMICONDUCTOR MEMORY 
DEVICE; 

U.S. Pat No. 4,509,148. issued Apr. 2, 1985 to Asano 
et al, cntitted SEMICONDUCTOR MEMORY DE- 
VICE; 

U.S. Pat Na 4,477,739, issued Oct 16, 1984 to Pro- 
ebsting et al, entitled MOSFET RANDOM ACCESS 
MEMORY CHIP; 

U.S. Pat. No. 4,447,893, issued May 8, 1984 to 55 
Murakami, entiticd SEMICONDUCTOR READ 
ONLY MEMORY DEVICE; 

U.S. Pat No. 4,410,964, issued Oct 18, 1983 to Nor^ 
dling et al, entitled MEMORY DEVICE HAVING A 
PLURAIJTY OF OUTPUT PORTS. 

Descriptions of techniques using precharge signab 
cependeni upon a memory address are found m U.S. 
Pat No. 4.520,465, issued May 28, 1985 to Sood, enti- 
tied METHOD AND APPARATUS FOR SELEC- 
TIVELY PRECHARGING COLUMN LINES OF A 
MEMORY and U.S. Pat No. 4,513.372, issued Apr. 23, 
1985 to Zicgler et al. entitled UNIVERSAL MEM- 
ORY. 



operated in a pipelined manner witn more than one 
access propagating through the chip at any given time 
and wherein the cycle time is limited by sub-array cy- 
cles, p A still further object of the present invention is to 
provide a semiconductor random access memory chip 
wherein the cycle time is less tnan tne access time for a 
memory chip naving cycle times greater than access 
times for accesses through the same sub^array. 

Still another object of the present invention is to 
provide a semiconductor random access memory chip 
incorporating dynamic storage techniques for achieving 
very fast access and precharge times. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGS. l(fl) and 1(d) is a schematic illustrations of a 
256K semiconductor memory chip partitioned into a 
plurality of sub>arrays including bitswitches, sense am- 
plifiers, word line drivers and precharge circuits ac- 
40 cording to the principles of the present invention. 

FIG. 2 is a schematic illustration of a simplified depic- 
tion of a conventional 64K semiconductor memory chip 
including a plurality of macros according to the prior 
art. 

FIG. 3-1 is a schematic illustration of a simplified 
depiction of a semiconductor memory chip ncluding 
both a local precharge/reset technique and block ad- 
dress circuitry according to the principles of the present 
invention. 

FIG. 3-2 is a schematic illustration of a simplified 
depiction of a semiconductor memory chip similar to 
that of FIG. 3-1 including both a local precharge/reset 
technique and block address compare circuitry and 
further including a compare technique according to the 
principles of the present invention. 

FIGS. 4 and 5 are illustrations of timing diagrams 
useful in describing the operation of the semiconductor 
memory structure of the present invention. 
FIGS. 6 and 7(a) and 7(6) are block diagram illustra- 
60 tions of the pipeline segments in the access path of a 
semiconductor memory chip according to the princi- 
ples of the present invention. 

DISCLOSURE OF THE INVENTION 

Referring to FIG. 1, a schematic illustration, referred 
to in the art as a floor plan, is shown for a 256K bit 
embodunent of a CMOS semiconductor chip for a 
cache memory according to the present invention. 
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The particular embodiment of the 256K bit chip 
shown in FIG. 1 uses a second level metal layer to 
partition the chip into thirty-two 8K bit sub-arrays. 
Each sub^amy is organized as 128 word tines by 64 
bitiine pairs with 4-way bitswitches and 16 resistively 
decoupled, self-timed sense amplifiers which are lo- 
cated inboard, next to the sub<array because of the use 
of a second level metal layer. The structure uses stan- 
dard CMOS memory cells composed of six devices. 
The present invention may include, however, embodi- 
ments using single layer metal as well as three, four or 
more metal Uyeis. 

More specifically, the 2S6K bit chip structure of FIG. 
1 includes 32 sub-arrays arranged in 8 columns and 4 
rows. The abbreviadoos used in FIG. 1 refer to the 1^ 
following elements. 







cs 


Chip Select Not Input 








SA 
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Write BHiwitch 
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Local Write BiiSwitch 
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Features of the chip of FIG. 1 mclude a chip cycle 
time that is less than the access time, whOe also having 
a fast access time. This is accomptished by a number of 
techniques. 

3 One technique employed in FIG. 1 is that blocks in a 
critical path arc designed such that their active plus 
prechargc time is less than the access time of the chip. A 
key feature of the invention is that dynamic storage 
techniques are used to make it possible to achieve very 
10 fast access and precharge times. Also, a specific version 
of the known techniques of self-timing is used block-to- 
block and internally. 

To reduce word line delay, the chip of FIG. 1 is 
segmented into 8 local word lines with the global word 
lines on a first level metal layer and the local word lines 
on a polycide layer. 

The delay in developing data signals on the bitlines is 
reduced by segmenting the chip into 4 rows and by 
wiring the bitlines on a second level metal layer. 

The block select decoders and driver circuits are 
centered to reduce metal RC delays. 

Separate read and write paths are used with the wnte 
bitswitches placed at the opposite ends of the bitlines 
from the read bitswitches to minimize delay for both a 
23 read and write operation. 

The 256K SRAM bit chip u«ng the floor plan of 
FIG. 1 with sub-arrays is operated in a pipelined man- 
ner with more than one access propagating through the 
chip at any given time. In addition, the floor plan with 
iiiboard sense amplifiers b applicable to DRAM opera- 
tion with only a slight increase in access time with the 
restore portion of the cycle being hidden by me pipe- 
lined mode of operation as will be more fully described 
relative to the DRAM embodiment of FIG. 3.2. 

As previously stated, in the floor plan for a 2S6K 
SRAM shown in FIG. 1, the chip has been partitioned 
into 32 128 WLx64 BL sub-arrays by making use of 
second layer metal. The optimum sire and number of 
sub-arrays is influenced by chip access time require- 
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As shown in FIG. 1, each sub-array includes a sepa- 
rate read bitswitch, write bitswitch, bitiine precharge 
circuit, local word line driver and sense amplifier. Local 
word line and local read and write bitswitch decoder/- 

drivers are associated with each of the 32 sub-arrays. X 40 ments and array utilization. The second level of metal 
address amplifiers and Y address amplifiers are coupled also makes it practical to have inboard sense amplifiers 
to the word line and bitswitch decoder/drivers and for improvement of access time by reducing the loading 
block select decoder/drivers respectively, under con* on the output lines. Bitswitches are used so a sense 
trol of a clock signal generated froni the Chip Select amplifier can be shared between four bit lines, reducing 
Not Input. Data*In amplifiers provide inputs to each of 43 the loading on the sense*amp set signal, compared to 



the 32 sub-arrays under control of the clock signal and 
the write input 

The sense amplifiers associated with each of the 32 
sub-arrays are connected to data output lines via data- 
out latches and off-chip drivers. 

The sub-array arrangement illustrated in the embodi- 
ment of FIG. 1 includes local decoding and precharging 
and therefore, b operable in a pipelined manner with 
more than one a cce ss being capable of propagating 



having a sense-amp for each bitiine. The sense ampltfi- 
ers for each sub-array are self-timed locally and totally 
self-contained. 
Each of the sub-arrays in the new floor plan is essen- 
50 tially self contained, with its own localized word line 
driver, self-timed sense amplifier circuitry and pre- 
charge circuitry. During an access only a single sub- 
array is activated. Having only a small fraction of the 



chip (1/32 for the 256K example) accessed each cycle 
through the chip at any given time. The cycle time of ss has important ramifications for the design of a pipelined 
the chip b Umited by Uie sub-array cycle time.. FIG. 4 memory with more than one access propagating 
illustrates what is meant by cycle tune of the chip and through the chip at a given time, 
access time ofthe chip. Thus, chip access time (Tyfcci* In simplified form, a prior an memory chip consists 
T>icc2 etc.) is the time it takes, beginning with a given of a number of blocks or macros as shown in FIG. 2. 
chip to be selected, for the selected chip information to 60 During an access, data simply ripples from block to 



appear at the chip output. The chip cycle time is the 
selection repetition rate which indicates when, or how 
frequcndy a chip can be selected. Cycle time is desig- 
nated as 1,2.3 etc on the horizontal axis of FIG. ^ and 
is shown to be less than the access time. "Cycle time** 
throughout the following invention description applies 
to either the write operation or the read operation, in 
any order. 
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block with one block activating the next one and a 
global reset is used. In the prior art, as illustrated in 
FIG. 2, data "ripples'* because the data from the output 
of one block activates the next block (i.e., weU-known 
input triggering) but the blocks are "globally" reset by 
a signal generated by some other block which is fed 
back and resets a plurality of blocks, such as illustrated 
by the precharge reset connection in FIG. 2. 
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To achieve cycle time less than access time so the accesses, the probability of accessing one of the last 
RAM can be pipelined, a localized precharge is per- three sub-arrays accessed is 3/32, A compare on five of 
formed as shown in FIG. 3>1 as an improvement over the address bits is required each access. Thus, it is possi- 
prior .art global precharge as employed in FIG. 1 An ble for a memory chip with long sub-array precharge to 
example of the localized precharge is described In the 5 operate in a pipelined mode the majority of the time 
pablicatioD by B.A. Chappell et al, in the IBM Technt- with cycle time less than access time, 
cal t^losurc Bulleiin, Vol. 30, No. 7, dated December The systems implicadon ot a pipelined memory with 
1987 entitled "Self-Timed Pulsed Wordlme*'. Other cycle time less than access time can be understood by 
examples of input triggered, self-resetting circuits are considering the timing diagram of FIG. 4 and FIG. 5 
referred to in the art as address-tranststton-detection . 10 and the pipeline segment block diagrams of FIG. 6 and 
circuits. With the subdivided floor plan, the precharge FIG. 7, Two cases are considered. The first in FIG. 4 
signal can be generated locally and the loading on the and FIG. 6, assumes that the active plus precharge times 
precharge clock line is not large. The 256K design has of each block is less than i the access time. Therefore, 
only 8K bits of sub-array which must be precharged the chip can be pipelined with a cycle time of \ the 
each cycle. The sub-arrays can be considered as an 15 acce» time for both a read and a write. No comparisons 
array of chips with only one of them being acdvated are needed on incoming addresses. For this case the 
each selection. The sub-arrays with their own localized bandwidth of the chip is twice what it would be for a 
word line drivers, bitswitches, self timed sense amplifi- chip with access time = cycle time that is not pipelined, 
era and precharge circiuts are virtually independent The second case (FIG. S and FIG. 7) assumes that the 
chips. 20 active plus precharge time of the slowest block (the 

Additionally, each or the global blocks, externa! to sub-array) is twice' the access time and that all other 
the sub-array local circuitry, has self-timed precharge blocks are less than ) the access time. It is also assumed 
and reset circuitry, in other words, each block in the that comparisons are done on incoming address to 
critical path shown in FIG. 3-1 is switched into the check whether or not the access is to a sub-array ac- 
active state by tne previous blocks input signal, but is 25 cessed on one of the last three cycles. For the case 
returned to its precharge/standby state by self-con- where the access is not to one of these same sub-arrays 
tained circuitry. and there is no match on the compare, the chip will run 

Being able to precharge a block very quickly after it in a pipelined mode with a cycle time of i the access 
has performed its funcdon in antidpadon of the next time. If the access is to one of these same sub-arrays, 
access is a key requirement for a memory with cycle 30 there will be a match on the incoming address and the 
time less than access time. The minimum tune before cycle trme will be extended. Therefore, the bandwidth 
another access can be started is the active time plus the for this pipelined case compared to a chip that is not 
precharge time for the slowest block in the access path. pipelined but has the same access time and a cycle time 
The sub-array precharge, becaxise of the need to accu- of twice the access lime is 
rately equalize the bit lines, is difficult to accomplish in 3S 
a short period of time. Thus, the chip cycle time is 

limited by the sub-array cycle time. A six-device CMOS _ oa/ f i ( '^^^ -ac \ S\ 

cell, as used in the 256K SRAM. aUows the shortest Bw^BW^y^\^ j x 3 j 

cycle dme. 

The floor plan of FIG. 1 with inboard sense amplifi- 40 where 
m makes it possible to achieve almost the same access BW,=band width without pipelining 
tmie for a Dynamic Random Access Memory (DRAM) BW=:band width with pipelining 
array as an SRAM array. However, because of the need TNA^total number of accesses 
to restore the data in the accessed cell, it wili take con- AC^ accesses with compare, 
siderably longer for the precharge portion of the cycle. 45 accesses are random in nature, the bandwidth can 

In order to operate the chip of FIG, 1 in a pipelined gjygn by 
mode of operation for the cases where a long precharge 
is needed, initiadon of another access to the chip is 
permitted as long as that access is not to the same sub- 
array as- in the last three previous accesses. As shown in SO / \ \ ^ 
FIG. 3-2. thtt is accomplished by comparing the sub-. bw =; bw^ ( \ + [ ~ 3 j 
array selectioti bits with those of the previous three . k V / J 
-accesses. If the previous accesses are to different sub- 
arrays resulting in a no match with the compare func- where 

don, the new access would proceed while the previ- 55 Pc=probability of a compare 
ously accessed sub-array is being restored (thus "hid- NSA= number of sub-arrays, 
mg** the restore portion of the previously accessed sub- Thus for either random or sequential addresses one 
array as it b overlapped by the new access). For the should see almost a four times increase in bandwidth 
case where the compare found a match, the chip would compared to a conventional chip. For a DRAM, the 
go into a wait state until the sub-array precharge is 60 amount of time a chip is not available because of re- 
completed and the new access is initiated. freshing would be reduced by this same factor. 

By storing data from sequential addresses in different Thus, two approaches to the design of a pipelined 
sub-arrays, it is possible to minimize the probability of memory chip with cycle tune less than access time using 
an access to the last three accessed sub-arrays. For the a floor plan with sub-arrays have been described. The 
256K example given, there are 32 sub-arrays. Unless 65 first approach assumes the active plus precharge por- 
addresses were incremented by ^ word (32) increments dons of each block is less than the access time. In the 
the probability of returning on successive accesses to second approach, it is assumed that the active plus pre- 
the last three sub-arrays accessed is small. For random diarge portions of the sub-array block is greater than 



(I) 



BW = BWfil + (I - P() X 3) (2) 
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the access time, and that the active plus precharge por- 
tions of Che rest of the blocks is less than the access time. 
For both cases, a substantial increase in memory chip 
bandwidth is posable in memory systems using SRAM 
and DRAM chips. 3 

Having thus described our invention, what we claim 
as new, and desire to secure by Letters Patent is: 

1. A pipelined semiconductor 2" Kbit memory chip, n 
being an integer not less than 2, said chip being seg- 
mented into a plurality of 2" -J* memory sub-arrays of 2>* 10 
Kbits arranged in columns and rows on said chip, each 
one of said V-y memory sub-arrays includes a separate 
associated word line driver circuit means, sense ampli- 
fier circuit means and independent precharge circuit 
means connected thereto, each of said independent IS 
precharge circuit means of each of said segmented 
memory sub-arrays providing local self-timed reset and 
precharge function for each segmented memory array 
independent of said other of said plurality of 2"-^J^ mem- 
ory arrays, 

wherein said memory chip exhibits an access time t 
for providing data from said memory chip and 
wherein said local reset and precharge circuits of 
each of said segmented memory sub-arrays pro- 
vides a cycle time for each sub-array which is less 25 
than chip access time t 

X A pipelined semiconductor 2" Kbit memory chip 
according to claim 1 wherein said segmented memory 



sub-arrays mcluding independent local decoding and 
precharging means operate in a pipelined manner with 
greater than one access propagating through said 2" 
Kbit memory chip at one time. 

3. A pipelined semiconductor 2" Kbit memory chip 
according to claim 2 further including row and column 
address circuits disposed on said chip connected to said 
word line and bitswitch decoder/driver circuit means 
and responsive to input address access signals for select- 
ing ones of said 2" segmented memory sub-arrays for 
access. 

4. A pipelined semiconductor 2" Kbit memory chip 
according to claim 1 further including compare circuits 
for comparing at least two signals for determining said 
sub-arrays being accessed by separate access signals. 

5. A pipelined semiconductor 2" Kbit memory chip 
according to claim 1 wherein said memory chip exhibits 
an access time t. further including global blocks on said 

20 chip external to said sub-arrays each containing clock 
circuit means, address buffer means, row decoder 
means, data output bufiier means and word driver means 
associated with said plurality of sub-arrays, 
each of said global blocks including separate reset and 
precharge circuit means for providing a cycle time 
for each of said global blocks which is less than 
chip access time L 
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